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ABSTRACT 

We propose an approach for capturing the signal variability in hy- 
perspectral imagery using the framework of the Grassmann mani¬ 
fold. Labeled points from each class are sampled and used to form 
abstract points on the Grassmannian. The resulting points on the 
Grassmannian have representations as orthonormal matrices and as 
such do not reside in Euclidean space in the usual sense. There are 
a variety of metrics which allow us to determine a distance matrices 
that can be used to realize the Grassmannian as an embedding in Eu¬ 
clidean space. We illustrate that we can achieve an approximately 
isometric embedding of the Grassmann manifold using the chordal 
metric while this is not the case with geodesic distances. However, 
non-isometric embeddings generated by using a pseudometric on the 
Grassmannian lead to the best classification results. We observe 
that as the dimension of the Grassmannian grows, the accuracy of 
the classification grows to 100% on two illustrative examples. We 
also observe a decrease in classification rates if the dimension of the 
points on the Grassmannian is too large for the dimension of the Eu¬ 
clidean space. We use sparse support vector machines to perform 
additional model reduction. The resulting classifier selects a subset 
of dimensions of the embedding without loss in classification perfor¬ 
mance. 

Index Terms — Grassmannian classification, Grassmann man¬ 
ifold embedding, multidimensional scaling, sparse support vector 
machines. 

1. INTRODUCTION 

Hyperspectral imagery can be viewed from a data perspective as 
three-way arrays, often referred to as data cubes. The spectral bands 
that make up these cubes can be viewed as high-dimensional color 
spaces extending the three colors of visual photography to n dimen¬ 
sions where n can vary from tens to hundreds depending on the ac¬ 
quisition hardware. A set of k pixels from the same data class can be 
characterized as a fc-dimensional subspace of R". The key feature 
of the set of k pixels is that they capture variation in the class signal 
that is typical of its representation. The central idea of this paper 
is a novel algorithmic framework for capturing the geometry of the 
Grassmannian in Eulcidean space for the purposes of representing 
the variability of information in hyperspectral imagery. 

We propose to encode observations of interest as subspaces. A 
collection of subspaces has a natural mathematical structure known 
as the Grassmann manifold. In mathematics this is referred to as 
an abstract manifold since it does not reside in Euclidean space. 
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i.e., it’s properties are not described by n-tuples whose distances are 
measured via inner products. Recently it has become an active area 
of research to develop computational algorithms on non-Euclidean 
spaces QHH. 

Nash’s famous isometric embedding theorem shows under what 
conditions Riemannian manifolds, Grassmann manifolds are a spe¬ 
cial case of these, can be embedded into Euclidean space such that 
the distances between points on the manifold are preserved m 
There are some interesting counter examples to this, e.g., the polar 
cap of a sphere can not be embedded isometrically into the plane, 
distances between point must be distorted. 

One approach for embedding subspaces into Euclidean space 
is multidimensional scaling (MDS), see, e.g., O and references 
therein. The result is a configuration of points in Euclidean space 
whose distances approximate the distances measured on the abstract 
manifold. One can also establish in certain circumstances that the 
distances are preserved, i.e., the embedding is isometric. 

Geometric approaches have been proposed for characterizing 
data on manifolds, i.e., nonlinear objects that behave locally like Eu¬ 
clidean space. These data driven approaches for manifold learning 
include, e.g., isometric mapping (ISOMAP) 13, local linear embed¬ 
ding (LLE) (6), and Laplacian Eigenmaps Q- A number of practical 
algorithms based on ISOMAP and LLE have been proposed for ap¬ 
plications to hyperspectral imagery, see, e.g., i). We note that in 
these methods, a manifold coordinate system is derived from com¬ 
puting the geodesic distances between the hyperspectral pixels, i.e., 
they are algorithms operating in pixel space. The algorithms applied 
to pixel space are using manifolds as a model for the data. In our ap¬ 
proach, we first encode sets of pixels as subspaces which are viewed 
as points on a Grassmann manifold, the existence of which is the¬ 
oretically guaranteed. The Grassmann manifold is then embedded 
into Euclidean space using MDS. As we will describe below, this 
setting then permits the simultaneous classification and dimension 
selection performed in the Euclidean space. 

The outline of this paper is as follows. In Section|^we describe 
the mathematical framework behind encoding collections of pixels 
as subspaces using the geometry of the Grassmann manifold. In 
Sectionl^we outline the methodology for approximating an isomet¬ 
ric embedding of the Grassmannian. The algorithm is summarized 
in Section |4] and the computational results discussed in Section |3 
We summarize our findings in Section[3 

2. FRAMEWORK 

The real Grassmann manifold (Grassmannian) G{k, n) is the col¬ 
lection of all fc-dimensional subspaces of R", for fixed k < n. 
It has dimension k{n — k) where in our case n is the number of 
wavelengths (spectral bands) and fc is the number of pixels (taken 
from the same class) to be used to define the subspaces. Points on 
G(fc, n) are represented by orthonormal matrices. These points are 
actually equivalence classes in the sense that any two bases which 



have the same span are viewed as the same point on the Grassman- 
nian. The Grassmannian characterizes fc-dimensional subspaces of 
71 -dimensional Euclidean space as a quotient space of the orthogonal 
group, G{k,n) = o(k)^C)(n-k) ■ Note that this is in contrast to other 
methods for representing subspaces, e.g., Stiefel manifolds O. 

The power of the Grassmann manifold lies in its ability to pro¬ 
vide a geometric framework for characterizing the relationship be¬ 
tween subpaces. In particular, there is a rich geometric structure that 
permits a variety of metrics to be employed. We briefly summarize 
the basic facts here; for additional details see ( 9 ). 

The Grassmann manifold is endowed with a Riemannian struc¬ 
ture which affords the computation of distances between the points 
on the manifold using, e.g., geodesics (i.e. curves of the shortest 
length). There are several different definitions of the distance on 
G{k, n) used in the literature. The geodesic and chordal distances 
Qo) between two subspaces Vi and 7^2 are given by 

k 

dg(iPi.iP2) = (;^e?)'/" = ii0ii2 

i=l 

and 

k 

dc{Vi,'P2) = (^(sin6»i)^)^^^ = ||sin6l||2, 

i=l 

respectively, where 6 = [61,62, ■■■ ,0k] (0 < 61 < 62 < ... < 
dk < 7 r/ 2 ) is the vector of the principal vectors between the sub¬ 
spaces Vi and 7^2. Note that principal angles are computed using 
SVD-based algorithm applied to the matrices Pi and P2 that are or¬ 
thonormal bases for the subspaces Vi and 'P2, respectively lim . We 
also define a pseudometric given by dp = 61 (the smallest principal 
angle) to be used in our computations; see also i). 

There are several options for determining an abstract point of 
the Grassmannian from data assembled as columns of a matrix Y £ 
jjnxfe -pjjg most inexpensive is the Gram-Schmidt algorithm for 
constructing a QR decomposition. The thin Singular Value Decom¬ 
position (SVD) of y = provides a characterization of data 

in terms of the nxk orthonormal matrix U. Given the column space 
of y, TZ{Y), is spanned by the columns of U, for computational pur¬ 
poses we can choose U to be a point on the Grassmannian. 

Suppose we constructed p points on G{k, n), Ui, U2, ..., Up, 
each associated with a set of pixels from the same class, using the 
method described above. We can now obtain a symmetric distance 
matrix D € with Du = 0 and Dij > 0 being equal to one of 

the distances between Ui and Uj. 

3 . EMBEDDING THE GRASSMANN MANIEOLD INTO 
EUCLIDEAN SPACE 

In this section we describe how to realize a configuration of a collec¬ 
tion of fc-dimensional subspaces as an embedding d-tuples where the 
appropriate size of the Euclidean space d is to be determined from 
the data. 

We assume that we have obtained a collection of abstract points 
on the Grassmannian G{k, n). These points are generated by sub¬ 
spaces of the sampled data. In practice, n is fixed to be the number 
of spectral bands and k can vary as we change the size of our col¬ 
lection of similar points. Although one could consider subspaces of 
different dimension, we assume in this paper that all points are on 
the same Grassmannian. 

Given a collection of points on G{k, n), the next step is to pro¬ 
duce a configuration of points in Euclidean space (of some dimen¬ 
sion to be determined) that faithfully captures the interpoint dis¬ 


tances, i.e., an isometric embedding. An attractive tool for accom¬ 
plishing this is multidimensional scaling (MDS). 

MDS is able to produce a configuration of points in a Euclidean 
space that captures the (possibly non-Euclidean) distances of the 
original data in an optimal manner. It is worth emphasizing the we 
are measuring distances in the embedded Euclidean space as usual, 
but that they are approximating distances that were measured on a 
manifold. Note again, that this is not always possible - spheres 
can’t be isometrically embedded in planes. Given a distance matrix 
D £ R^’^^’, the basic steps of MDS are: 

1 . Compute B = P[AH, where H = I — and Aij = 
— ^Dij (e is a vector of p ones). 

2 . Compute the spectral decomposition B — TAT^. 

3 . SetX TA^ 

A is a configuration of points in R*^, where d — rank{B) = 
rank{X) < p — 1 . If 73 is positive semi-definite, then there exists 
a configuration of points in the Euclidean space associated with the 
positive eigenvalues whose interpoint distances are given by D 0 
In practice, in the case when 73 is not positive semi-definite (i.e., 
negative eigenvalues occur), the Euclidean space associated with the 
positive eigenvalues provides the optimal approximation of the con¬ 
figuration of points. 

4 . ALGORITHM 

To obtain a collection of abstract points on the Grassmannian 
G{k, n), we generate subspaces for a chosen dimension fc; in prac¬ 
tice we look at the effect of the choice of k on the classification 
results. We begin from sampling with replacement subsets of the 
data points, forming nxk ’tall-skinny’ matrices y. Note that each 
y contains only points from the same class. We also retain the 
appropriate classification labels for these matrices as they will be 
needed in classification step. Then, as described in Section we 
select Ui to be an orthonormal matrix that spans TZ{Yi). 

Once the points on the Grassmannian have been embedded in a 
Euclidean space of appropriate dimension d (Section^, we employ 
a linear sparse support vector machine (SSVM) EDO to perform 
classification in the Euclidean space. An SSVM, or £i-nonr[j SVM, 
has an attractive feature of being able to reduce the number of vari¬ 
ables, i.e., the dimension of the data, as determined by the non-zero 
weights in the ^i-norm optimization problem. In our case, the SSVM 
reduces the dimension of the embedding: non-zero weights in the 
decision function indicate optimal dimensions of the embedding out 
of d dimensions determined by the number of positive eigenvalues 
of 73 . These optimal dimensions can be used for improving clas¬ 
sification rates and embedding visualization. In this paper, we use 
a pre-defined tolerance to determine non-zero SSVM weights that 
indicate the optimal dimensions of the embedding. 

As noted in Section[T] our approach differs from spectral meth¬ 
ods such as ISOMAP or LEE, that derive a manifold coordinate sys¬ 
tem for original data sets. In our approach, the data points are en¬ 
coded as subspaces on a Grassmann manifold, and then embedded 
into Euclidean space optimally preserving distances between them. 
We perform the basic steps in the following order: original data in 
R"' —)• points on G{k, n) —^ embedding in R'* —>• SSVM classifica¬ 
tion and dimension selection. 

The algorithm below summarizes the proposed approach with 
more details: 

*Forx G R", ||x||i = JDILi \^i\ 




Algorithm: Classification on Grassmannians 


Input: A hyperspectral data matrix X € whose columns 

are pixels with class labels di,i = 1 ,... ,m, subspace dimension k, 
number of points p to be sampled on G{k, n). 

1. Compute p points Ui on G{k, n) by finding orthonormal ma¬ 
trices that span TZ{Yi). 

2. Compute the pairwise distances between points on G{k, n). 

3. Implement the MDS algorithm to obtain a configuration of 
points in dimension d, where d is the number of positive 
eigenvalues. 

4. Train an SSVM to obtain classification accuracy and optimal 
dimensions in the Euclidean space. 

Output: Embedding of G{k, n) into R'* and an SSVM model in the 
Euclidean space; subset of SSVM selected optimal dimensions. 

5. COMPUTATIONAL RESULTS 

We illustrate the algorithm and the mathematical framework using 
the well-known hyperspectral AVIRIS Indian Pines data se0. This 
data was collected by the Aibome Visible/Infrared Imaging Spec¬ 
trometer (AVIRIS) over a small agricultural area in Northwestern 
Indiana in 1992. It consists of 145 x 145 pixels by 220 bands from 
0.4 to 2.4/im. Due to availability of the ground truth, 10366 pixels 
were prelabeled to be in one of the 16 classes. The unlabeled back¬ 
ground pixels were not used in our experiments. To limit the scope of 
our paper, we consider two two-class problems and one three-class 
classification problem. We preprocessed the data by mean center¬ 
ing, i.e., the mean of the data was subtracted from each pixel in the 
scene. The data was randomly partitioned into 50% for training and 
50% for testing. Eor this data, the original input space dimension is 
n = 220. 

In a typical 2-class experiment, to derive an embedding for a 
chosen dimension k, we assemble 100 n x k matrices Yi from each 
class, which results in constructing p = 200 points Ui on G{k, n). 
This set of points contains equal numbers of training and testing sub¬ 
spaces. Following the steps of the Algorithm, we obtain a distance 
matrix D € R^^^, and then apply MDS to embed the points, keep¬ 
ing track of the class labels of the subspaces. 

Figures [T] and illustrate Euclidean embedding configurations 
obtained for various k using the geodesic distance dg for the two 
classes Corn-notill and Grass/Pasture and three classes Corn-notill, 
Grass/Pasture, and Grass/Trees, respectively. Note that for the visu¬ 
alization purpose, the chosen two dimensions correspond to the two 
top eigenvalues of the matrix B (MDS). We see the class separation 
becoming stronger as we increase the dimension k of the Grassman¬ 
nians. 

Table [T] contains accuracy rates and dimension selection results 
in the embedded spaces for the two classes Corn-notill and Grass- 
Pasture, and Tablel^contains results for classes Soybeans-notill and 
Soybeans-min. The former pair of classes has low classification dif¬ 
ficulty in comparison to the latter. The Soybean-notill and Soybeans- 
min classes were selected, because, to our knowledge, no-one has 
ever achieved 100% classification accuracy on them. Note that the 
results are averaged over 10 runs. For each k, we obtain a configura¬ 
tion of points in a new embedded space of dimension d, which cor¬ 
responds to the number of positive eigenvalues of the matrix B. We 
compare results obtained on embedded Grassmannians using two 

^This dataset can be downloaded from 

http:/dynamo.ecn.purdue.edu/'biehl/MultiSpec. 
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Fig. I: Configuration of points on G{k, 220) embedded in Euclidean 
space using dg for 2 classes: Corn-notill (circles) and Grass/Pasture 
(plus signs). Dimensions correspond to the two top eigenvalues of 
B (MDS). 



Fig. 2: Configuration of points on G{k, 220) embedded in Euclidean 
space using dg for 3 classes: Com-notill (circles), Grass/Pasture 
(plus signs), and Grass/Trees (triangles). Dimensions correspond to 
the two top eigenvalues of B (MDS). 


distance metrics, chordal dc and geodesic dg, and the pseudometric 
dp defined as the smallest principal angle between subspaces. Note 
that in case of the classes Com-notill and Grass-Pasture, they are 
separated starting fc = 10 for all three choices of a metric. The best 
accuracy result we could obtain on the original data using SSVM 
was 99.27%, while the Grassmannian framework allows one to im¬ 
prove the result up to 100%. The number of dimensions selected 
by SSVM is very small for both chordal and pseudometric. As for 
the higher difficulty classification problem, namely, Soybeans-notill 
and Soybeans-min separation, we also have improved the accuracy 
results using our framework. SSVM, applied to the original data, has 
89.57%, which is less than some accuracy results obtained using dif¬ 
ferent metrics on the embedded Grassmannians. We observe that the 
performance of the pseudometric degrades from perfect to random 
as the number of dimensions is increase beyond the optimal number. 
This is a consequence of the fact that there are more dimensions in 






































Table 1: Accuracy results and feature selection in embedded spaces for p = 200 points on G{k, 220), Corn-notill versus Grass-Pasture, 
averaged over 10 runs: 


Dimension of subspaces k 

Number of negative eigenvalues of B 

SSVM Accuracy (%) 

Number of dimensions selected 

Chordal 

Geodesic 

Pseudo 

Chordal 

Geodesic 

Pseudo 

Chordal 

Geodesic 

Pseudo 

1 

13.5 

68.1 

69.5 

80.6 

85.4 

85.3 

24.4 

41.5 

36.7 

5 

0 

52 

94.3 

96.6 

96 

99.4 

146 

123.4 

23.7 

10 

0 

5.6 

96.6 

99.7 

100 

100 

150.4 

143.5 

15.3 

15 

0 

0 

98.2 

100 

99.8 

100 

160.6 

172 

7.6 

20 

0 

0 

99.6 

100 

100 

100 

161.1 

164.9 

5.2 

25 

0 

0 

99.7 

100 

98.2 

100 

162.7 

171.9 

5.4 


Table 2: Accuracy results and feature selection in embedded spaces for p — 200 points on G{k, 220), Soybeans-notill and Soybeans-min, 
averaged over 10 runs: 


Dimension of subspaces k 

Number of negative eigenvalues of B 

SSVM Accuracy (%) 

Number of dimensions selected 

Chordal 

Geodesic 

Pseudo 

Chordal 

Geodesic 

Pseudo 

Chordal 

Geodesic 

Pseudo 

1 

8.2 

62.4 

63.3 

68 

68.1 

66.7 

41.6 

40.1 

49 

5 

0 

50.6 

90.2 

67.6 

62.6 

90.6 

175.6 

138.9 

31.7 

10 

0 

3.9 

92.9 

86.7 

74.3 

99.9 

181.7 

176.8 

18.4 

15 

0 

0 

94.5 

94.2 

86.6 

100 

175.8 

179.3 

10.5 

20 

0 

0 

96.2 

99 

92 

95 

169.5 

181.2 

4.9 

25 

0 

0 

97.3 

93.7 

92.8 

50 

166.3 

172 

3.3 


which to find a smallest single angle. 

6. CONCLUSIONS 

We propose an algorithm for analyzing hyperspectral imagery us¬ 
ing the geometric framework of the Grassmann manifold. Sets of 
same class pixels are encoded as points on Grassmann manifold and 
subsequently embedded as a configuration in Euclidean space using 
multidimensional scaling. A sparse support vector machine algo¬ 
rithm is then able to further reduce the dimension of the classifier in 
Euclidean space by selecting only MDS dimensions most useful for 
classification. 

The results confirm the assertion that the geodesic distance is in¬ 
ferior to the chordal metric for applications of the Grassmann frame¬ 
work Go). However, we have observed that the use of a smallest sin¬ 
gle angle pseudo-metric improves classification accuracy even fur¬ 
ther. The accuracies on the two class test problems grew to 100% on 
the test data as we increased the dimension of the Grassmannian but 
only the pseudometric obtained 100% classification on the challeng¬ 
ing soybean data. The SSVM also provided additional insight into 
the pseudometric showing that far fewer dimensions are required by 
the classifier. 
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